SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANTS: 

WARREN, Patrick V. 
SWANSON, Ronald V. 

(ii) TITLE OF INVENTION: 

TRANSAMINASES AND AMINOTRANSFERASES 

(iii) NUMBER OF SEQUENCES: 40 



(iv) CORRESPONDENCE ADDRESS: 




' (vi) 



(vii) 



(viii) 



(ix) 



(A) ADDRESSEE: 



!LLA, BYRNE, BAIN, GILFILLAN, 

CECCHI, STEWART & OLSTEIN 
;R FARM ROAD 



(B) STREET: 6 BEG 

(C) CITY: ROSE 

(D) STATE: NEW JERSEY 

(E) COUNTRY: USA 

(F) ZIP : 07068 

COMPUTER READABLE FC 

(A) MEDIUM TYPE: 3.5 INCH DISKETTE 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM :\ MS-DOS 

(D) SOFTWARE: WORD PERFECT 5.1 

CURRENT APPLICATION D^&TA: 

(A) APPLICATION NUMBERS Unassigned 

(B) FILING DATE: Concurrently 

(C) CLASSIFICATION: Unatesigned 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

ATTORNEY / AGENT INFORMAT I oJ 

(A) NAME: HERRON , CHARLE&J . 

(B) REGISTRATION NUMBER: 28,019 

(C) REFERENCE / DOCKET NUMBER: 331400-38 

TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-994-170C 

(B) TELEFAX: 201-994-1744\ 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC AOID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR) 



(ii) 
(xi) 



MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: V5EQ ID NO:l: 

3( 



CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGATTGAA GACCCTATGG AC 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CGGAAGATCT TTAAGCACTT CTCTCAGGTT C 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE' 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGGACAGG CTTGAAAAAG TA 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS 
(A) -LENGTH : 31-NUCLEOT.IDES . ._ 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CGGAAGATCT TCAGCTAAGC TTCTCTAAGA A 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGTGGGAA TTAGACCCTA AA 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CGGAGGATCC CTACACCTGT TTTTCAAGCT C 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGACATAC TTAATGAACA AT 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

-- (D) -TOPOLOGY.: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
CGGAAGATCT TTATGAGAAG TCCCTTTCAA G 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGCGGAAA CTGGCCGAGC GG 



(2) 



INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS 

( (A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNES S : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
CGGAGGATCC TTAAAGTGCC GCTTCGATCA A 

,0} INFORMATION FOR SEQ ID NO: 11: 

( m SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNES S : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGTGCGGG ATAGTCGGAT AC 

U) 1[ A) LENGTH : 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi ) .SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CGGAAGATCT TTATTCCACC GTGACCGTTT T 



(2) 

(i) 



INFORMATION FOR SEQ ID NO: 13: 
QEOUENCE CHARACTERISTICS 
S^NGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGACAATTG ATTAAAGAGG AGAAATTAAC TATGATACCC CAGAGGATTA 



(21 INFORMATION FOR SEQ ID NO: 14. 

<2) (i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 



33 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQIDN0-.14: ^ 
CGGAAGATCT TTAAAGAGAG CTTGAAAGGG A 

, 2 , INFORMATION FOR SEQ ID NO: 15: 

( ' ,-\ CPniTENCE CHARACTERISTICS 

U) IJfSSw: 52 NUCLEOTIDES 
B TYPE: NUCLEIC ACID 
(C STRANDEDNESS: SINGLE 
(D) TOPOLOGY : LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:X5: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAAGCCG TACGCTAAAT AT 

INFORMATION FORSBQ ID NO :16s 

/d( TYPE: NUCLEIC ACID 
(?) STRANDEDNESS: SINGLE 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQIDN0:16: ^ 
CGGAAGATCT CTAATACACA GGAGTGATCC A 

INFORMATION FOR SEQ ID NO: 17: 

" B, "<» M^sfS|f : - - - -■■ 

B) TYPE: NUCLEIC ACID 

C) SrANDEDNESS: SINGLE 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

(xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 

IXi) m« AAG AGA CTG 



(2) 

(i) 



™ *ar r*A CTC AAG TAC AAG CTA AGG 

s s: ss s ss s s s a s. w . ^ - - 

20 

— ?SSSS 2 S £ S S E S 

Arg Glu Oly Glu Asp Val vai 4 5 

Pro Pro Ala bys His He lie Asp ny 6Q 



96 



144 



192 
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CCG AAC GTT CAC GGA TAT TCT GCG TCA AGG GGC ATA CCA AGA CTG AGA 
Pro Asn Val His Gly Tyr Ser Ala Ser Arg Gly lie Pro Arg Leu Arg 
65 70 75 80 

AAG GCT ATA TGT AAC TTC TAC GAA GAA AGG TAC GGA GTG AAA CTC GAC 
Lys Ala lie Cys Asn Phe Tyr Glu Glu Arg Tyr Gly Val Lys Leu Asp 
85 90 95 

CCT GAG AGG GAG GCT ATA CTA ACA ATC GGT GCA AAG GAA GGG TAT TCT 
Pro Glu Arg Glu Ala lie Leu Thr He Gly Ala Lys Glu Gly Tyr Ser 
100 105 110 

CAT TTG ATG CTT GCG ATG ATA TCT CCG GGT GAT ACG GTA ATA GTT CCT 
His Leu Met Leu Ala Met He Ser Pro Gly Asp Thr Val He Val Pro 
115 120 125 

AAT CCC ACC TAT CCT ATT CAC TAT TAC GCT CCC ATA ATT GCA GGA GGG 
Asn Pro Thr Tyr Pro He His Tyr Tyr Ala Pro He He Ala Gly Gly 
130 135 140 



GAA GTT CAC TCA ATA CCC CTT AAC TTC TCG GAC GAT CAA GAT CAT CAG 
Glu Val His Ser He Pro Leu Asn Phe Ser Asp Asp Gin Asp His Gin 
145 150 155 160 

GAA GAG TTT TTA AGG AGG CTT TAC GAG ATA GTA AAA ACC GCG ATG CCA 
Glu Glu Phe Leu Arg Arg Leu Tyr Glu He Val Lys Thr Ala Met Pro 
165 170 175 

AAA CCC AAG GCT GTC GTC ATA AGC TTT CCT CAC AAT CCA ACG ACC ATA 
Lys Pro Lys Ala Val Val He Ser Phe Pro His Asn Pro Thr Thr He 
180 185 190 

ACG GTA GAA AAG GAC TTT TTT AAA GAA ATA GTT AAG TTT GCA AAG GAA 
Thr Val Glu Lys Asp Phe Phe Lys Glu He Val Lys Phe Ala Lys Glu 
195 200 205 



CAC GGT CTC TGG ATA ATATCKC GAT~TTT^ GCG TAT -GCG-GAT_ ATA_GCC JTTT 
His Gly Leu Trp He He His Asp Phe Ala Tyr Ala Asp He Ala Phe 
210 215 220 

GAC GGT TAC AAG CCC CCC TCA ATA CTC GAA ATA GAA GGT GCT AAA GAC 
Asp Gly Tyr Lys Pro Pro Ser He Leu Glu He Glu Gly Ala Lys Asp 
225 230 235 240 

GTT GCG GTT GAG CTC TAC TCC ATG TCA AAG GGC TTT TCA ATG GCG GGC 
Val Ala Val Glu Leu Tyr Ser Met Ser Lys Gly Phe Ser Met Ala Gly 
245 250 255 



TGG AGG GTA GCC TTT GTC GTT GGA AAC GAA ATA CTC ATA AAA AAC CTT 
Trp Arg Val Ala Phe Val Val Gly Asn Glu He Leu He Lys Asn Leu 
260 265 270 

GCA CAC CTC AAA AGC TAC TTG GAT TAC GGT ATA TTT ACT CCC ATA CAG 
Ala His Leu Lys Ser Tyr Leu Asp Tyr Gly He Phe Thr Pro He Gin 
275 280 285 

GTG GCC TCT ATT ATC GCA TTA GAG AGC CCC TAC GAA ATC GTG GAA AAA 
Val Ala Ser He He Ala Leu Glu Ser Pro Tyr Glu He Val Glu Lys 
290 295 300 

ACC GCA AAG GTT TAC CAA AAA AGA AGA GAC GTT CTG GTG GAA GGG TTA 
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* Ala „ « ^ -JJ ~ — - S ~ S 

305 1008 

Asn Arg Leu Gly Trp y 33Q 33* iQ56 

325 m , aTfi AAC TCT CTG GAC TTT l<-v- 

340 11Q4 

-2SSSSSSSSSESSSS 

asssasssssssaissss 

385 390 1245 

TNFORMATION FOR SEQ ID NO: 18: 

J TYPE: NUCLEIC ACID 
C \ STRANDEDNESS: SINGLE 
(D) TOPOLOGY: LINEAR 

(ii ) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQIDNO:18: ^ 48 

SSSSSSS5WSSSS5S 

20 cG 14i 

« mT » « o- s « - -SJESSSSiSt. 

Pro Asp Leu Glu Pro Ser reo y Q 4 5 

35 19 

SSSSSS2SSSSSS2SS 

65 ^ _ r npjv ACT TCG GGA GCG TTT 21 

-""rssssssss- «y - 

Val Ser Pro Glu Arg Val lie va gQ 

85 - 
_ ftCA CTA RAT GCG GGA GAG AAG ATA ATC CTC 
CTC GTA GCC TAC GCC GTA ACA CTA AAT 
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_ i Ala * - v.. - ~ » - - - s; Ile Mtt 

100 

115 

130 

* nr*n ptT CAC ATT TCC 

a £ s s s ss s s k £ s - - ~ s 
Sgassssssissasss 

180 

„ « « « « s «. -c -j « s - gs sss 

" e ^ AAC GOT TTT TCT AAG TAC TTC 

■» ™ iS -c s « « SS £ E£ S S - * ^ - 

Glu Phe Ser Asp Arg Ala lie v 220 

210 

^rpTv rrr rCG GAA GAA CTC 

5 S S 5 2 i S S 5SSS « * - - 

sssssssssssss. 

Val Arg Lys Ala 245- -- 

_ _. r GCT ttt GAT TAC GAG TAT TTG 

275 

rrT pag GGA GCT TTT 

290 

-assssssssssssas 

325 



384 



432 



576 



624 



816 



864 



1056 
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355 360 

1122 

TTA GAG AAG CTT AGC TGA 
Leu Glu Lys Leu Ser End 
370 



in) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS 

( (A) LENGTH: 1359 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

( C ) STRAND EDNES S : S INGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

20 25 

Arg Lys Tyr He Asp Ala lie Ser Ser Leu ixp y Q 
50 " 

65 70 

r*n* Ar«r TCC AAC GTT CCC GCC ATA CTC 
GTA GCT CAC ACA ACT ACT CTG GGA AGT TCC AAC GTT ^ ^ ^ ^ 
Val Ala His Thr Thr Thr Leu Gly Ser ser as 95 
85 * 

Leu Ala Lys Lys Leu Val Glu He faer e iiQ 
100 lu:> 

„ TAC TCC «C G3T « « £ « -J g « £ JS 

Phe Tyr Ser Glu Asp Gly Ala Glu Ala vax u ^ 
115 "° 

145 150 

GTA GGG GGT ATA GAA CTC TTC CAC GGA ACT TAT AAA GAT CTC CTT TTC 
Val Gly Gly He Glu Leu Phe His Gly Thr Tyr y * ^ 
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96 



144 



192 



240 



288 



336 



384 



432 



480 



528 
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AAG ACT ATA AAA CTC CCA TCT CCT TAC CTG TAC TGC AAG GAA AAG TAC 576 
Lys Thr He Lys Leu Pro Ser Pro Tyr Leu Tyr Cys Lys Glu Lys Tyr 
180 185 190 

GGG GAA CTC TGC CCT GAG TGC ACG GCA GAT TTA TTA AAA CAA CTG GAA 624 
Gly Glu Leu Cys Pro Glu Cys Thr Ala Asp Leu Leu Lys Gin Leu Glu 
195 200 205 

GAT ATC CTG AAG TCG CGG GAA GAT ATC GTT GCG GTC ATT ATG GAA GCG 672 
Asp He Leu Lys Ser Arg Glu Asp He Val Ala Val He Met Glu Ala 
210 215 220 

GGA ATT CAG GCA GCC GCG GGA ATG CTC CCC TTC CCT CCG GGA TTT TTG 720 
Gly He Gin Ala Ala Ala Gly Met Leu Pro Phe Pro Pro Gly Phe Leu 
225 230 235 240 

AAA GGC GTA AGG GAG CTT ACG AAG AAA TAC GAC ACT TTA ATG ATA GTT 768 
Lys Gly Val Arg Glu Leu Thr Lys Lys Tyr Asp Thr Leu Met He Val 
245 250 255 

GAC GAG GTT GCC ACG GGA TTT GGC AGG ACG GGA ACG ATG TTT TAC TGT 816 
Asp Glu Val Ala Thr Gly Phe Gly Arg Thr Gly Thr Met Phe Tyr Cys 
260 265 270 

GAG CAG GAA GGA GTC AGT CCG GAC TTT ATG TGT CTA GGT AAG GGT ATA 864 
Glu Gin Glu Gly Val Ser Pro Asp Phe Met Cys Leu Gly Lys Gly He 
275 280 285 

ACC GGA GGG TAC CTC CCG CTT GCT GCG ACA CTC ACA ACG GAC GAG GTG 912 
Thr Gly Gly Tyr Leu Pro Leu Ala Ala Thr Leu Thr Thr Asp Glu Val 
290 295 300 

TTC AAT GCC TTT TTA GGT GAG TTC GGG GAG GCA AAG CAC TTT TAC CAC 960 
Phe Asn Ala Phe Leu Gly Glu Phe Gly Glu Ala Lys His Phe Tyr His 
305 310 315 320 

GGG CACT ACC TAC ACT~GGA-AAT - -AAC-CTC . GCCJTGTJTCC GTT GCA CTC GCA 1008 
Gly His Thr Tyr Thr Gly Asn Asn Leu Ala Cys Ser Val- Ala — Leu-Ala__ 
325 330 335 

AAC TTA GAA GTT TTT GAG GAA GAA AGA ACT TTA GAG AAG CTC CAA CCA 1056 
Asn Leu Glu Val Phe Glu Glu Glu Arg Thr Leu Glu Lys Leu Gin Pro 
340 345 350 

AAG ATA AAG CTT TTA AAG GAA AGG CTT CAG GAG TTC TGG GAA CTC AAG 1104 
Lys He Lys Leu Leu Lys Glu Arg Leu Gin Glu Phe Trp Glu Leu Lys 
355 360 365 

CAC GTT GGA GAT GTT AGA CAG CTA GGT TTT ATG GCT GGA ATA GAG CTG 1152 
His Val Gly Asp Val Arg Gin Leu Gly Phe Met Ala Gly lie Glu Leu 
370 375 380 

GTG AAG GAC AAA GAA AAG GGA GAA CCT TTC CCT TAC GGT GAA AGG ACG ' 1200 
Val Lys Asp Lys Glu Lys Gly Glu Pro Phe Pro Tyr Gly Glu Arg Thr 
385 390 395 400 

GGA TTT AAG GTG GCT TAC AAG TGC AGG GAA AAA GGG GTG TTT TTG AGA 1245 
Gly Phe Lys Val Ala Tyr Lys Cys Arg Glu Lys Gly Val Phe Leu Arg 
405 410 415 

CCG CTC GGA GAC GTT ATG GTA TTG ATG ATG CCT CTT GTA ATA GAG GAA 1293 
Pro Leu Gly Asp Val Met Val Leu Met Met Pro Leu Val He Glu Glu 
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4,0 

o* c „ „ «= ~ ctx s « s SSSSSS 35 

Asp Glu Met Asn Tyr Val He Asp in ^ s 



1341 



440 

435 1359 



CT T GAA AAA GAG GTG TAG 
Leu Glu Lys Glu Val End 
450 



(2) 

(i) 



INFORMATION FOR SEQ ID NO: 20. 
SEOUENCE CHARACTERISTICS 
^ A) LENGTH : 1032 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii ) MOLECULE TYPE: GENOMIC DNA 
( Xi ) SEQUENCE DESCRIPTION: SEQIDNO-.20: 
ATG ACA TAC ATG AAC AAT TAC GCA AGG J» CCC GTA AAG Jg Val 

Met Thr Tyr Leu Met Asn Asn ±yj- ^ 15 

5 96 



S 52 S = 5 S S S 2£ = = S S S 5S S 

50 55 
65 70 

100 

«-?rsssss;^s |s is s s 

asp Lys Gly Lys Asn Lys Trp uy 125 
115 

- ^ s ~ s g * S S S S 35 £ S S 

His Gly Arg Thr Tyr Gly Ser i,eu ^ 

130 480 

^ -PT tcT TAC GCA AAG CTG * BU 

ssssssssss i s- - - - a 

«■ ^ » -C « « « - « - O^C 0*0 0» - 003 030 

40 



336 



384 



432 



Asn Asp lie Asp Ser Val Tyr Lys Leu Leu Asp Glu Glu Thr Ala oly 

165 170 

_ rnA rTT ATA QAA GGA GAG GGC GGA GTA AAC GAG GCG AGT 

S JS S SS S3 S Gin Gly Glu Gly Gly Val Asn Glu Ala Ser 
180 185 

210 215 
225 230 

260 265 
275 280 

290 295 

"305 310 " " 

S252SSSSS5S5S555 

325 330 



GCT CTT GAA AGG GAC TTC TCA TAA 
Ala Leu Glu Arg Asp Phe Ser End 
340 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1197 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1032 



(ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

ATG CGG AAA CTG GCC GAG CGG GCG CAG AAA CTG AGC CCC TOT CCC ACC 
Met Arg Lys Leu Ala Glu Arg Ala Gin Lys Leu Ser re ^ 

5 



48 
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~ ~~r* 7v Ai- rar rTT TTG CGG CAG GGG GAA AGG 

2 S SS S £ JS S 52 3! S K ^ «. «j «- «. 

20 25 

SgSSSSSSSSSSSSSig 

70 

CTT TAC CGC GAC ART Leu Glu S S i£ S SS ?S 

Leu Tyr Arg Asp Asn Gin Leu Glu Tyr ser pro gg 
85 3 

«»-?f£SSS S JS S 2 £S SE 2 £ 

Ser Cys Gly Ala Lys His Ser lie ene ahi 11Q 
100 10b 

« CC ° S 8 2S S! S2 £ S S SJ S Tyr Txp Thr S m 

Asp Pro Gly Asp Glu Val ue e 125 
115 " u 

SSSSlSSSSSSgSESSS 

145 150 
165 

ACA CGC ACC « TAC CGC CCG OAS CAA CTT ATC « » « ™ & 

Thr Gly Thr Val Tyr Arg Arg Glu Glu Leu y igQ 

180 1BS 

SSS23SSSS5SSSSS 

195 200 

SS5SS2SS5SSS52 

* 230 2 " 

ACC CGT TCC gC ATA C3T TAT OCT OCC CCT CCC « « «J 

Thr Gly Trp Arg He Gly Tyr Aia ax 255 
245 

GCC CAG GCC ATG ACC AAC CTC CAA AGC CAC AGT ACC TCT AAC CCC ACT 
Ala Gin Ala Met Thr Asn Leu Gin Ser His ser 



GCC CTG 
Ala Leu 



CTG ATC 
Leu He 
210 

GAG GTC 
Glu Val 
225 

GCC ATG 
Ala Met 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 
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or*. 270 
260 265 



sssassssssasssaa 

275 28 

~n nnr rrr TTT CAA AAG CGG CGG GAT TTC ATC 
CCG GTG GAG AAC ATG CGC CGG GCT TTT CAA AA ^ ^ phe Jle 

Pro Val Glu Asn Met Arg Arg Ala Ptie em y ^ 
290 ^ S 

- - « T 2£ S 25 S 5SSS S £ - ~ 

Trp Gin Tyr Leu Asn Ser i,eu *^ jr 320 
305 31 

5S2SE2SSESSSSSSS 
52 £ 5 S S S S £ S S =S = S2 = 2£ 

ieu Olu Glu lie Lys val Ala Thr val Ala y ^ 

sssssssassssssa 



864 



912 



960 



1008 



1056 



1104 



1152 

\ 

\ 

1197 ^ 



385 390 



INFORMATION FOR SEQ ID NO : 22j 
dEOUENCE CHARACTERISTICS 
^S^NGTH: 1779 NUCLEOTIDE! 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

V . T.TTOT?AP 



•(D) TOPOLOGY: LINEAR 
(ii ) MOLECULE TYPE: GENOMIC DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



S5SSSSS5S2tS2J25 = SSS2a 

a tv a rrr CTA TGG GGA AAG GAT TAC AAG 

£ £ a £ as 2 1 - ^ 

50 



48 



96 



144 



192 



43 



GCT AAA ACG GGT ATA GGT CAC ACA CGC TGG GCA ACC CAC GGA AAG CCC 240 
Ala Lys Thr Gly He Gly His Thr Arg Trp Ala Thr His Gly Lys Pro 
65 70 75 80 

ACG GAC GAG AAC GCC CAC CCC CAC ACC GAC GAA AAA GGT GAG TTT GCA 288 
Thr Asp Glu Asn Ala His Pro His Thr Asp Glu Lys Gly Glu Phe Ala 
85 90 95 

GTA GTT CAC AAC GGG ATA ATA GAA AAC TAC TTA GAA CTA AAA GAG GAA 336 
Val Val His Asn Gly He He Glu Asn Tyr Leu Glu Leu Lys Glu Glu 
100 105 110 

CTA AAG AAG GAA GGT GTA AAG TTC AGG TCC GAA ACA GAC ACA GAA GTT 384 
Leu Lys Lys Glu Gly Val Lys Phe Arg Ser Glu Thr Asp Thr Glu Val 
115 120 125 

ATA GCC CAC CTC ATA GCG AAG AAC TAC AGG GGG GAC TTA CTG GAG GCC 432 
He Ala His Leu He Ala Lys Asn Tyr Arg Gly Asp Leu Leu Glu Ala 
130 135 140 

GTT TTA AAA ACC GTA AAG AAA TTA AAG GGT GCT TTT GCC TTT GCG GTT 480 
Val Leu Lys Thr Val Lys Lys Leu Lys Gly Ala Phe Ala Phe Ala Val 
145 150 155 160* 

ATA ACG GTT CAC GAA CCA AAC AGA CTA ATA GGA GTG AAG CAG GGG AGT 528 
He Thr Val His Glu Pro Asn Arg Leu He Gly Val Lys Gin Gly Ser 
165 170 175 

CCT TTA ATC GTC GGA CTC GGA GAA GGA GAA AAC TTC CTC GCT TCA GAT 576 
Pro Leu He Val Gly Leu Gly Glu Gly Glu Asn Phe Leu Ala Ser Asp 
180 185 190 

ATT CCC GCA ATA CTT CCT TAC ACG AAA AAG ATT ATT GTT CTT GAT GAC 624 
He Pro Ala He Leu Pro Tyr Thr Lys Lys He He Val Leu Asp Asp 
195 200 205 

GGG GAA ATA GCG^GAC~CTG ACT— CCC GAC^ACT J3TC AAC ATT TAC AAC TTT 672 
Gly Glu He Ala Asp Leu Thr Pro Asp Thr Val Asn Ile^Tyr~Asn-Phe — - __ ___ 
210 215 220 

GAG GGA GAG CCC GTT TCA AAG GAA GTA ATG ATT ACG CCC TGG GAT CTT 720 
Glu Gly Glu Pro Val Ser Lys Glu Val Met He Thr Pro Trp Asp Leu 
225 230 235 240 

GTT TCT GCG GAA AAG GGT GGT TTT AAA CAC TTC ATG CTA AAA GAG ATA 768 
Val Ser Ala Glu Lys Gly Gly Phe Lys His Phe Met Leu Lys Glu He 
245 250 , 255 

TAC GAA CAG CCC AAA GCC ATA AAC GAC ACA CTC AAG GGT TTC CTC TCA 816 
Tyr Glu Gin Pro Lys Ala He Asn Asp Thr Leu Lys Gly Phe Leu Ser 
260 265 270 

ACC GAA GAC GCA ATA CCC TTT AAG TTA AAA GAC TTC AGA AGG GTT TTA 864 
Thr Glu Asp Ala He Pro Phe Lys Leu Lys Asp Phe Arg Arg Val Leu 
275 280 285 

ATA ATA GCG TGC GGG ACC TCT TAC CAC GCG GGC TTC GTC GGA AAG TAC 912 
He He Ala Cys Gly Thr Ser Tyr His Ala Gly Phe Val Gly Lys Tyr 
290 295 300 

TGG ATA GAG AGA TTT GCA GGT GTT CCC ACA GAG GTA ATT TAC GCT TCG 960 
Trp He Glu Arg Phe Ala Gly Val Pro Thr Glu Val He Tyr Ala Ser 



44 



305 










310 










315 










320 




GAA 
Glu 


TTC 
Phe 


AGG 
Arg 


TAT 
Tyr 


GCG 
Ala 
325 


GAC 
Asp 


GTT 
Val 


CCC 
Pro 


GTT 

val 


TCG 
Ser 
330 


GAC 
Asp 


AAG 
Lys 


GAT 
Asp 


ATC 
He 


GTT 
Val 
335 


ATC 
He 


~\ nnft 

J.UUO 


GGA 
Gly 


ATT 

He 


TCC 
Ser 


CAG 
Gin 
340 


TCA 
Ser 


GGA 
Gly 


GAG 
Glu 


ACC 
Thr 


GCT 
Ala 
345 


GAC 
Asp 


ACA 
Thr 


AAG 
Lys 


TTT 
Phe 


GCC 
Ala 
350 


CTT 
Leu 


CAG 
Gin 


1056 


TCC 
Ser 


GCA 
Ala 


AAG 
Lys 
355 


GAA 
Glu 


AAG 
Lys 


GGA 
Gly 


GCC 
Ala 


TTT 
Phe 
360 


ACC 
Thr 


GTG 
Val 


GGA 
Gly 


CTC 
Leu 


GTA 
Val 
365 


AAC 
Asn 


GTA 
Val 


GTG 
Val 


1104 


GGA 
Gly 

GGA 
Gly 
385 


AGT 
Ser 
370 
CCC 
Pro 


GCC 
Ala 

GAA 
Glu 


ATA 
He 

ATA 
He 


GAC 
Asp 

GGC 
Gly 


AGG 
Arg 

GTG 
Val 
390 


GAG 
Glu 
375 
GCG 
Ala 


TCG 
Ser 

GCT 
Ala 


GAC 
Asp 

ACA 
Thr 


TTT 
Phe 

AAG 
Lys 


TCC 
Ser 

ACC 
Thr 
395 


CTT 
Leu 
380 
TTC 
Phe 


CAC 
His 

ACC 
Thr 


ACA 
Thr 

GCA 
Ala 


CAT 
His 

CAG 
Gin 


GCG 
Ala 

TTC 
Phe 
400 


1152 
1200 


ACC 
Thr 


GCA 
Ala 


CTC 
Leu 


TAC 
Tyr 


GCC 
Ala 
405 


CTT 
Leu 


TCG 
Ser 


GTA 
Val 


AGG 
Arg 


GAA 
Glu 
410 


AGT 
Ser 


GAG 
Glu 


GAG 
Glu 


AGG 
Arg 


GAA 
Glu 
415 


AAT 
Asn 


1248 


CTA 
Leu 


ATA 
lie 


AGA 
Arg 


CTC 
Leu 
420 


CTT 
Leu 


GAA 
Glu 


AAG 
Lys 


GTT 
Val 


CCA 
Pro 
425 


TCA 
Ser 


CTC 
Leu 


GTT 
Val 


GAA 
Glu 


CAA 
Gin 
430 


ACA 
Thr 


CTG 
Leu 


1296 


AAC 
Asn 


ACC 
Thr 


GCA 
Ala 
435 


GAA 
Glu 


GAA 
Glu 


GTG 
Val 


GAG 
Glu 


AAG 
Lys 
440 


GTA 
Val 


GCG 
Ala 


GAA 
Glu 


AAG 
Lys 


TAC 
Tyr 
445 


ATG 
Met 


AAA 
Lys 


AAG 
Lys 


1344 


AAA 
Lys 


AAC 
Asn 
450 


ATG 
Met 


CTT 
Leu 


TAC 
Tyr 


CTC 
Leu 


GGA 
Gly 
455 


AGG 
Arg 


TAC 
Tyr 


TTA 
Leu 


AAT 
Asn 


TAC 
Tyr 
460 


CCC 
Pro 


ATA 
He 


GCG 
Ala 


CTG 
Leu 


1392 


GAG 
Glu 
465 


GGA 
Gly 


GCT 
Ala 


CTT 
Leu 


AAA 
Lys 


CTT 
Leu 
470 


AAA 
Lys 


"GAA 
Glu 


ATT 
He 


TCT- 
Ser 


TAC- 
Tyr 
475 


-ATA. 
lie 


CAC 
His 


GCG 
Ala 


GAA 
Glu 


GGT 

Gly ~ - - 
480 


1440 


TAT 
Tyr 


CCC 
Pro 


GCA 
Ala 


GGG 
Gly 


GAG 
Glu 
485 


ATG 
Met 


AAG 
Lys 


CAC 
His 


GGT 
Gly 


CCC 
Pro 
490 


ATA 
He 


GCC 
Ala 


CTC 
Leu 


ATA 
lie 


GAC 
Asp 
495 


GAA 
Glu 


1488 


AAC 
Asn 


ATG 
Met 


CCG 
Pro 


GTT 
Val 
500 


GTG 
Val 


GTA 
Val 


ATC 
lie 


GCA 
Ala 


CCG 
Pro 
505 


AAA 
Lys 


GAC 
Asp 


AGG 
Arg 


GTT 
Val 


TAC 
Tyr 
510 


GAG 
Glu 


AAG 
Lys 


1536 


ATA 
lie 


CTC 
Leu 


TCA 
Ser 
515 


AAC 
Asn 


GTA 
Val 


GAA 
Glu 


GAG 
Glu 


GTT 
Val 
520 


CTC 
Leu 


GCA 
Ala 


AGA 
Arg 


AAG 
Lys 


GGA 
Gly 
525 


AGG 
Arg 


GTT 
Val 


ATT 

He 


1584 


TCT 
Ser 


GTA 
Val 
530 


GGC 
Gly 


TTT 
Phe 


AAA 
Lys 


GGA 
Gly 


GAC 
Asp 
535 


GAA 
Glu 


ACT 
Thr 


CTC 
Leu 


AAA 
Lys 


AGC 
Ser 
540 


AAA 
Lys 


TCC 
Ser 


GAG 
Glu 


AGC 
Ser 


1632 


GTT 
Val 
545 


ATG 
Met 


GAA 
Glu 


ATC 
He 


CCG 
Pro 


AAG 
Lys 
550 


GCA 
Ala 


GAA 
Glu 


GAA 
Glu 


CCG 
Pro 


ATA 
He 
555 


ACT 
Thr 


CCT 
Pro 


TTC 
Phe 


TTG 
Leu 


ACG 
Thr 
560 


1680 


GTA 
Val 


ATA 
He 


CCC 
Pro 


CTG 
Leu 


CAA 
Gin 


CTC 
Leu 


TTT 
Phe 


GCC 
Ala 


TAC 
Tyr 


TTT 
Phe 


ATA 
He 


GCG 
Ala 


AGC 
Ser 


AAA 
Lys 


CTG 
Leu 


GGA 
Gly 


1728 



45 



565 



570 



575 



580 



ssssssasssESSsssK 1776 

580 bab 

1779 



TAA 
End 



(2) INFORMATION FOR SEQ ID NO: 23: 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

™m a ppt TAC AAG ACG GAG GTC 48 

S S S 32 gSJSS s as S 3 & - «. ~ 



35 40 

i-ssssssssssssss 

ATA CCC GTT TAC ATA COT GTT CCC ACC T£ CCC jg TAC « « J« 

He Pro Val Tyr He Pro Val Pro Tnr v ^ 
100 1U!3 

~ *7vr« prrtm P A A CTG GAC GAA AAC 

GCG AAA GTT CTC GGA AGA CCC CTC GTA AAG GTT CAA ^ ^ ^ ^ 
Ala Lys Val Leu Gly Arg Pro Leu vai ^y 125 
115 

„ gat ata «c™~j«sssi2SS55:s;5: 

Phe Asp He Asp Leu Glu Arg ser J-xe ^ 
130 

145 150 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



46 



175 

170 

165 



m ATR OAC «J « - « « £ E S K S S S £ 

Val He Asp Glu Ala iyr * 1Q5 
180 

195 

gsss=sS ssss S s55S 

210 m „ prr TTC aac GTG ACC TAC 

£ SSSS E £ ?S £ S « - - ^ ~ 3S 

ssasaaasaaaaaaaa 
-sssaaaassaaaaaas 

260 

mm pap PTT TTT CCG AGT AAG GCT 

SKaaaaassas-s"— 

275 

290 _. r TTA TCT TAC ATG GAA 

" £ 1 ~ s,t ^ Met - 

Glu Leu Leu Lys Arg Asp va 315 

305 „™ oar AAA CCG GAA GAA AAC 

ssas a a a - = s s e - G1 " a Mn 
s a a = a s a a a a a a a a a a 



TCT CTT TAA 
Ser Leu End 



(2) 



INFORMATI ON FOR SEQ ID NO : 24 : 

SSSiJ5SE c SS- 

(D) TOPOLOGY: LINEAR 
MOLECULE TYPE: GENOMIC DNA 

ATG AAG CCG TAC GCT AAA TAT Aa 



48 



„ et ,ys Pro Tyr W. ^ *P « - °» «» ST ^ 

5 

™ ™r PTG TTG ACT CAC GCG CTT CAC TAC GGA 

SSS«S»SSSS2« H i3 ~ hu o t»t Gly 

SSSS2SSSSS52SSSSS 
is2S£2SSJSSSSS5ES 

85 

■ sasssssssssssa" 

100 

115 "° 

mm **r rra ACG ATT GTA AGC TGG CGT AGA 
TAC CTC TCG CCC AAC GGC ATT AAG GCA ACG ATT ^ ^ ^ ^ Arg 
Tyr Leu Ser Pro Asn Gly lie uy^ 14Q 

145 150 

„ MC tct « s s s sssssssss « 

Val Asn Ser Val Leu Ala l«u v«u. ^ 175 

„™ aar «TT TAT GTT GTT GAG GGT TCT GGA 

S S S St S « " S£ S S - v.x ox. S v « 

180 

195 2UU 

„ un Arir , OAT ACG GTA ATA AAG CTC 

CAC GAA TCT ATC CTC GAG GGA ATT ACG AGG GAT ACG ^ ^ ^ ^ 
His Glu Ser He Leu Olu Giy 22Q 
210 

SS5SS2SSSSSSSS5S 

225 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



48 



£ S E SK SS SS 32 ~ S S A ^ S 25 S S E 

260 265 

rrr GCC CCC ATT ACG ACA AAA ATA GOT GAG CTG TAC TCA AAC GTC GTG 
Pro 35 So S S Thr Lys He Ala Glu Leu Tyr Ser Asn Val Val 

275 280 
AGA GGC AAA GTA GAG AAA TAC TTA AAT TGG ATC ACT CCT GTG TAT TAG 
Arg Gly Lys Val Glu Lys Tyr Leu Asn Trp He Thr Pro Val Tyr End 



816 



864 



912 



290 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 414 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



Met lie Glu Asp Pro Met Asp Trp Ala Phe Pro Arg He Lys Arg Leu 

5 10 
Pro Gin Tyr Val Phe Ser Leu Val Asn Glu Leu Lys Tyr Lys Leu Arg 

20 25 
Arg Glu Gly Glu Asp Val Val Asp Leu Gly Met: Gly Asn Pro Asn Met 

Pro Pro Ala Lys His lie lie Asp Lys Leu Cys Glu Val Ala Gin Lys 
50 55 
-Kb" Ash Val His Gly Tyr Ser Ala Ser. Arg Gly He Pro Arg Leu Arg 

65 70 

Lys Ala He Cys Asn Phe Tyr Glu Glu Arg Tyr Gly Val Lys Leu Asp 

* 85 90 

Pro Glu Arg Glu Ala He Leu Thr lie Gly Ala Lys Glu Gly Tyr Ser 

100 105 

His Leu Met Leu Ala Met He Ser Pro Gly Asp Thr Val He Val Pro 

115 120 
Asn Pro Thr Tyr Pro He His Tyr Tyr Ala Pro lie He Ala Gly Gly 

130 135 
Glu val His Ser He Pro Leu Asn Phe Ser Asp Asp Gin Asp His Gin 
145 150 

Glu Glu Phe Leu Arg Arg Leu Tyr Glu He Val Lys Thr Ala Met Pro 

165 170 

*i a val He Ser Phe Pro His Asn Pro Thr Thr He 

Lys Pro Lys Ala Val Val i±e bei 

180 185 
Thr val Glu Lys Asp Phe Phe Lys Glu He Val Lys Phe Ala Lys Glu 



195 



200 



49 



His Gly Leu ~p He He Hi. ASP Phe Ala Tyr - «* ~ 

210 215 
», Gly Tyr Lys Pro - S« .U Leu Giu g. 01- Gly «■ -J 

225 23U 



x. c „ Tj vs civ Phe Ser Met Ala Gly 
Val Ala Val Glu Leu Tyr Ser Met Ser Lys Gly *n ^ 
245 

Trp Arg val Ai, Phe val val Gly » Glu 1" — «» & "» — 

260 

A1 . His Leu Lys S~ T^^ug^ GLy He Phe Thr Pro He Giu 

275 280 
val Ala ser He He Ala Leu On Ser Pro Tyr Glu He val Glu Lys 

290 

Thr Ala Lys val Tyr Glu Lys Arg Arg Asp V.l Leu val Glu Gly Leu 
Z Arg Leu Gly Trp Lys Val Lys Lys Pro Lys Ala Thr ».t Phe val 

Trp Al, Lys He Pro Giu Trp He Asn Met Asn Ser Leu Asp Phe Ser 

340 

„ Phe Leu Leu Lys Glu Al. Lys val Al, val ser Pro Gly val Gly 

355 360 

, ^ -.7=1 a-rrr Phe Ala Leu Val Glu Asn 
Phe Gly Gin Tyr Gly Glu Gly Tyr Val Arg Phe Ala 

370 

01 » His Arg He Arg GLn Ai. He Arg Gly lie Arg Lys Al. Phe Arg 



385 "0 



,y. Lev. ■•,» Glu at, Lys_Leu Giu Pro Glu Arg ser «j_-d 



INFORMATION FOR SEQ ID NO: 26: 
1 (i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 373 AMINO ACIDS 



(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: PROTEIN 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
Met Asp Arg Leu Glu Lys Val Ser Pro Phe He Val Met Asp lie Leu 

Ala Gin Ala Gin LyI Tyr Glu Asp Val Val His Met Glu lie Gly Glu 
Pro Asp Leu Glu Pro Ser Pro Lys Val Met Glu Ala Leu Glu Arg Ala 
val Lys I Lys Thr Phe Phe Tyr Thr Pro Ala Leu Gly Leu Trp Glu 



Lys 
50 



55 



Mu *rg Glu Arg He Ser Glu Phe Tyr Arg Ly. Lys Tyr Ser val Giu 

50 



nc 80 

65 70 75 

Val Ser Pro Glu Arg Val He val Thr Thr Gly Thr Ser Gly Ala Phe 

85 90 

Leu val Ala Tyr Ala Val Thr Leu Asn Ala Gly Glu Lys lie He Leu 

100 10b 



Pro Asp Pro Ser Tyr Pro Cys Tyr Lys Asn Phe Ala Tyr Leu Leu Asp 

115 12U 
Ma Gin Pro Val Phe Val Asn Val Asp Lys Glu Thr Asn Tyr Glu Val 

130 135 
Arg Lys Glu Met He Glu Asp He Asp Ala g. Ala Leu His He Ser 
145 150 

Ser Pro Gin Asn Pro Thr Gly Thr Leu Tyr Ser Pro Glu Thr Leu Lys 

165 170 

Blu Leu Ala Glu Tyr Cys Glu Glu Lys Gly Met Tyr Phe lie Ser Asp 

180 18b 
Glu II. Tyr His Gly Leu val Tyr Glu Gly Arg Glu His Thr Ala Leu 

Glu Phe HI Asp Ar g Ala lie III He Asn Gly Phe Ser Lys Tyr Phe 

210 215 



Cys Met Pro Gly Phe Arg He Gly Trp Met He Val Pro Glu Glu Leu 

225 23( ^ 

Val Arg Lys Ala Glu He Val He Gin Asn Val Phe He Ser Ala Pro 

Leu Ser Gin Tyr Ala Ala Leu Glu Ala Phe Asp Tyr Glu Tyr Leu 

260 2bi> 

Glu Lys Val Arg Lys Thr Phe Glu Glu Arg Arg Asn Phe Leu Tyr Gly 

275 280 
Glu Leu Lys Lys Leu Phe Ly. lie Asp Al. Lys Pro Gl» Gly Al, Phe 

290 2yb 
T^r val Trp Ala Asn He Ser Asp. Tyr Ser Thr Asp Ser Tyr Glu Phe 
305 310 

A la Leu Lys Leu Leu Arg Glu Ala Arg Val Ala Val Thr Pro Gly Val 

325 JJU 

ASP Phe Gly Lys Asn Lys Thr Lys Glu Tyr Jle Arg Phe Ala Tyr Thr 

340 34b 
Lys He Glu Glu Leu Lys Glu Gly Val Glu Arg He Lys Lys Phe 



355 360 



Leu Glu Lys Leu Ser 
370 



51 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 453 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Trp Glu Leu Asp Pro Lys Thr Leu Glu Lys Trp Asp Ly S Glu Tyr 

Phe Trp His Pro Phe Thr Gin Met Lys Val Tyr Arg Glu Glu Glu Asn 

20 ^ 

L eu lie Phe Glu Arg Gly Glu Gly Val Tyr Leu Trp Asp He Tyr Gly 

35 4U 
Arg Lys Tyr He Asp Ala lie Ser Ser Leu Trp Cys Asn Val His Gly 

50 55 
His Asn His Pro Lys Leu Asn Asn Ala Val Met Lys Gin Leu Cys Lys 

65 70 
val Ala His Thr Thr Thr Leu Gly Ser Ser Asn Val Pro Ala lie Leu 
85 yu 

-l Tle cer Pro Glu Gly Leu Asn Lys Val 

Leu Ala Lys Lys Leu Val Glu lie Ser Fro jr ^ 

100 10b 
Phe Tyr Ser Glu Asp Gly Ala Glu Ala Val Glu He Ala lie Lys Met 



115 120 



Trp Lys Asn Lys Gly Val Lys Gly Lys Asn Val Phe 



Ala Tyr His Tyr Trp Lys as* uy „ — , ^ 

130 135 



lie Thr Leu Ser Glu Ala Tyr His Gly Asp Thr Val Gly Ala Val ser 

145 150 

Val Gly Gly He Glu Leu Phe His Gly Thr Tyr Lys Asp Leu Leu Phe 
165 

Lys Thr lie Lys Leu Pro Ser Pro Tyr Leu Tyr Cys Lys Glu Lys Tyr 

1 180 18b 

O, ». L» 0y S Pro «. Cys Thr Ala Asp I~ I- WJ «- -» «» 

195 ^ uu 
Mp He no I** ser Arg Gl» Asp He V,! AX, val He Met Glu Ala 



220 



Gly lie Gin Ala Ala Ala Gly Met Leu Pro Phe Pro Pro Gly Phe Leu 

225 230 

Lys Gly Val Arg Glu Leu Thr Lys Lys Tyr Asp Thr Leu Met lie val 

Asp Glu val Ala Thr Gly Phe Gly Arg Thr Gly Thr Met Phe Tyr Cys 
260 2bi> 



52 



olu «. «» «» v,x s « «. « - « «■ - ss - Gly Ile 

275 

»n ai, Thr Leu Thr Thr Asp Glu Val 
Thr Gly Gly Tyr Leu Pro Leu Ala Ala Thr ^ 



290 295 



Phe Asn Ala Phe Leu Gly Olu Phe Oly Olu Ala Lys His Phe ^ gj 

tm =. rvq ser Val Ala Leu Ala 
Gly His Thr Tyr Thr Oly Asn Asn Leu Ala Cys 

325 

Mn 0 Xu vaX - «» «» % ^ - 31 ° - ^ "° 

340 

WB u. s - - - s - G1 ° Glu phe s? Glu " u LyB 

H1 . v,x « ox„ ~u ox y - - «y n. «. « 

370 

val Lys „ «. - -SB * - ~ « 5! ^ Gly ° lu " 9 - 

X « V,X AXa ^ «. C= - OXV Val » 

m ». «y « 1 - « - S "* "° " S! 

420 

Mp G1 u « » ^ - XX, Jg -r ~ «, JJ. XXe W - 

435 

Leu Glu Lys Glu Val 
450 



(2) INFORMATION FOR SEQ ID NO: 28 : 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii ) MOLECULE TYPE: PROTEIN 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Thr Tyr Leu Me, Asn Asn Tyr Ala Ar, Leu Pro Val Lys Phe Val 

5 

riu Glv Lys Glu Tyr Leu 
Arg Gly Lys Gly Val Tyr Leu Tyr Asp Glu Glu Gly 
20 

up , he « S « «, xxe <u y vax » ~ - -y -1 - ~ - 

Lys wu I OX. AXa - Ts «- - VaX 0X U Ly s _ - His val 



50 

Ser Asn Leu 



Olu » «• «P «» sl » «? — " 1S Ll " " 



70 
65 



53 



W Hi. - «g ~ «" ^ *S ~ " S - 

^ Glu s« « 2 ». - - - ™ - 35 " 3 

100 

nv,o Tie Ser Phe Glu Asn Ser Phe 
ASP Lys Gly Lys Asn Lys Trp Lys Phe He 

115 

n , la Thr Gly Gin Pro Lys Pne 
His Gly Arg Thr Tyr Gly Ser Leu Ser Ala Or Gly 

130 135 

His Lys «, « «u » - V., « «, g. se, ^ - «■ - 

145 



., n. M p JJ. v.! Tyr L». «- - ^ «» «» " 

Ile n. *. «» - - - S! 01y ° 1Y val " - " ~ 

180 

rlT1 rlu ne Cys Lys Glu Lys Asp Val 
Tm Ser Lys Leu Gin Glu xJ-e ^y=» jt 
Glu Asp Phe Leu bet uy=> 
195 

MU „ xx. n. « - v S -» * «* n. «j «. - - 

210 ^ X 

„ - - £■ - » - ws S MP val " e A " ~ 

225 

u , Lys Gly » «J «v - » .JJ «, ~ n. » - ™ 

24b 

oh Thr- Pro Gly Ser His Gly Ser Thr Phe 
Olu Glu Val Ala Gin Ser Phe Thr Pro Gly 270 

260 

" Gly ^ ™ - ~ v s -«^-r™*- 

« «, - - « - - - ° la Vil S! " B ^ *" 

«. Z - - «» - Gly ws 01y ws - ws " " ffi 
,- s -«» -.-g. — — 

32b 

Ala Leu Glu Arg Asp Phe Ser 
340 



(Bl TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE : PROTEIN 



54 



SEQUENCE DESCRIPTION: SEQIDNO:29, 
„ et Arg Lys »u Olu Arg Al. olu ,ys « Eer P» Ser « «- 

Leu ser v.! asp ~ w «£ "» «» 01 ° "S G1 ° M9 

20 ZO 
Val He A sn Phe Gly Ala Gly Glu Pro Asp Phe Asp Thr Pro Olu His 

Ala Leu Asp Gin Gly Phe Thr Lys Tyr 



35 40 



He Lys Glu Ala Ala Lys Arg Ala Leu ^ — ~- 
50 5b 



^ pro v,! Al. oly u« «- «° « «S " e 



65 70 



« Tyr Arg Asp As» olu - «- ^ "J "° " " e ^ 

8 5 



ser cys Oly Ala Lys HI. ser He Phe A»n Ala «, «n val leu u» 

100 lu::> 
^ Pro oly MP «- V,X He I- -.v.! Pro Tyr Trp Thr ~ Tyr 

K o olu 2 vai ,y - Al, oly Oly val Pro Val Phe val Pro Thr 



115 

Val Lys Leu ^ 

s „ Z «- » -p - Sl «• « »■ M ° SS ^ " u w "* "° 

145 lb 

val Thr P,o Arg Arg - «- ^ » ~ "» t?S "° 

5ly W V«-™ Arg-olu-Olu ^ S- G - 1 -- V -^- 



u , _ Olu Al. Asp Leu Trp lie - Ser Asp olu He Tyr Olu 

195 ^ UU 
^ u H. Tyr Asp Oly Me, Olu His Val ser He Ala Al, « Asp Pro 



210 215 



g; val Lys ,ys Arg ». - -I V,l Asu Oly Val Ser ,ys Ala Tyr 

if, Met Thr Oly Trp Arg He Oly Tyr Ala Ala Ala Pro Arg Pro II. 

245 

,1, olu Ala Met Thr Asn Leu olu Eer His Ee, Thr E.r » Pro Thr 
Eer V,l Ala I Ala Ala Ala Leu Al. Ala Leu Lys Oly Pro olu olu 

275 Z0U 



V.1 Olu AS. Met Arg Arg Ala Phe Olu Lys Arg Arg Asp Phe He 

290 295 

. Qn ser Leu Pro Gly val Arg Cys Pro Lys Pro Leu 
Trp Gin Tyr Leu Asn Ser Leu rro * ^ 320 

55 



305 310 



Gly Ala Phe Tyr Val Phe Pro Glu val Glu Arg Ala Phe Gly Pro Pro 

1 325 330 

Ser Lys Arg Thr Gly Asn T*r Thr Ala Ser Asp Leu Ala Leu Phe Leu 

L eu Glu Glu lie Lys Val Ala Thr Val Ala Gly Ala Ala Phe Gly Asp 

355 360 
ASP Arg Tyr Leu Arg Phe Ser Tyr Ala Leu Arg Leu Glu Asp He Glu 



355 

Leu Arg Phe 
370 375 

Glu Gly Met Gin Arg Phe Lys Glu Leu lie Glu Ala Ala Leu 
385 390 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 592 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 



Met Cys Gly He Val Gly Tyr Val Gly Arg Asp Leu Ala Leu Pro He 

Val Leu Sly Ala Leu Glu Arg Leu Glu Tyr Arg Gly Tyr Asp Ser Ala 

20 25 
Gly val Ala Leu He Glu Asp Gly Lys Leu lie Val Glu Lys Lys Lys 

35 40 
Gly Lys He Arg Glu Leu Val Lys Ala Leu Trp Gly Lys Asp Tyr Lys 

50 " " 55 

Ala Lys Thr Gly He Gly His Thr Arg Trp Ala Thr His Gly Lys Pro 

«,r Asp Glu Asn Ala His Pro His Thr Asp Glu Lys Gly Glu Phe Ala 

85 JXJ 

Tin no pin Asn Tvr Leu Glu Leu Lys Glu Glu 
Val Val His Asn Gly He He Glu Asn ryr ue ^ 

100 10b 

L eu Lys Lys Glu Gly Val Lys Phe Arg Ser Glu Thr Asp Thr Glu Val 

115 12U 
II. Ala His Leu He Ala Lys Asn Tyr Arg Gly Asp Leu Leu Glu Ala 

130 135 
val Leu Lys Thr Val Lys Lys Leu Lys Gly Ala Phe Ala Phe Ala Val 



145 



150 



Xle Thr val His Glu Pro Asn Arg Leu lie Gly Val Lys Gin Gly Ser 



165 



170 



Pro Leu He Val Gly Leu Gly Glu Gly Glu Asn Phe Leu Ala Ser Asp 



180 185 



56 



■ U Pro Al. He Leu Pro Tyr Thr Lys Lys II. He v.l Leu MP Asp 

195 ^ uu 
Gly Olu He Ala Asp Leu Thr Pro Asp Thr V.l Asu He Tyr Asn Ph. 

210 21b 
Olu Gly Olu Pro Val Ser Lys Glu Val Met lie Thr Pro Trp Asp Leu 

225 230 

val Ser Ala Glu Lys Gly Gly Phe Lys His Phe Met Leu Lys Glu He 

245 

^ olu Gin Pro Lys Ala He Asn Asp Thr Leu Lys Gly Phe Leu Ser 

J 260 

* Olu Asp Al. lie Pro Phe Lys ». Lys «P - Jg «» 

275 280 
Ile He Al, Cys oly Thr Ser Tyr His Ala Oly Phe Val Oly Lys Tyr 

290 295 
T rp lie Olu Arg Phe Ala Oly Val Pro Thr Glu Va! lie Tyr Al, Ser 

305 310 

Olu Phe Arg Tyr Al. Asp V.l Pro V.1 Ser Asp Lys Asp He v.l He 



305 

Ala Asp Val Pro Val 
325 33U 
Gly He Ser Gin Ser Gly Glu Thr Ala Asp Thr Lys Phe Ala Leu Gin 

340 

ser Al. Lys Glu Lys oly Ala Phe Thr Val Oly Leu V.l Asu V.l Val 

355 JbU 



Qpr Asr> Phe Ser Leu His Thr His Ala 
Gly Ser Ala He Asp Arg Glu Ser Asp Pne » ^ 

3" 70 , \1 S = Thr- lvs Thr Phe Thr Ala Gin Phe 

Gly Pro Glu He Gly Val Ala Ala Thr Lys Tnr 4Q0 

385 ~ 390 _ ~~ 

, t c Dr val Arcj Glu Ser Glu Glu Arg Glu Asn 
Thr Ala Leu Tyr Ala Leu Ser Val Arg exu 415 



^ He Arg Leu Leu olu Lys val Pro Ser Leu v.l Olu oln Thr Leu 

As, Thr Ala Til Olu V.1 Olu Lys v.! Ala Olu Lys Tyr Met Lys Lys 

435 



1M rlv Aj-q Tyr Leu Asn Tyr Pro He Ala Leu 
Lys Asn Met Leu Tyr Leu Gly Arg lyr ^ Q 

450 * ss 

^-i n= cor tvt He His Ala Glu Gly 
Glu Gly Ala Leu Lys Leu Lys Glu He Ser Tyr 48Q 

465 47 

Tyr Pro Ala Oly Olu Met Lys His Oly Pro He Ala Leu II. Asp Olu 

485 

Asn Met Pro val V.l V.l lie Al. Pro Lys Asp Arg val Tyr Olu Lys 

500 bu:> 

» v»i rlu Glu Val Leu Ala Arg Lys Gly Arg Val He 
He Leu Ser Asn Val Glu giu vai ^ 525 

57 



515 520 



ser v.l sly P»e «Y » «»- — — !S WS "* ° 1U S " 



530 " 5 



Val ««t Glu II. Pro ». Ma Glu «» « g. «r Pro Phe « 560 

545 550 

Val P« I*u Gl„ l*u ». «y * "« ^ SS ^ 

Mu ^ Va! Asp I Pro Arg Asu £u Al, Lys ». Val Thr v.l 31. 

580 



/,! INFORMATION FOR SEQ ID NO: 31: 

(2) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 354 AMINO ACIDS 



(i) 



(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 



(ii) MOLECULE TYPE: PROTEIN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met He Pro Gin Arg He Lys Glu Leu Glu Ala Tyr Lys Thr Glu Val 
ttr Pro Ala Ser Val Arg Leu Ser Ser Asn Glu Phe Pro Tyr Asp Phe 

Pro Glu Glu lie Lys Gin Arg Ala Leu Glu Glu Leu Lys Lys Val Pro 
35 *° 

Glu Ala Lys Glu Leu Lys Ala Val Leu 



Leu Asn Lys Tyr Pro Asp Pro Glu Ma — ~ Q 
f a Asp Phe Phe Gly Val Lys Glu Glu Asn Leu Val Leu Gly Asn Gly 
l Asp Glu Leu lie Tyr Tyr Leu Ser lie Ala He Gly Glu Leu Tyr 

Ile Pro Val Tyr He P~ Val Pro Thr Phe Pro Met Tyr Glu He Ser 

100 xuo 
u . Lys val Leu Oly Arg Pro Leu val Lys va! Glu Leu A.P «- - 

115 

a™ q« He Glu Leu He Glu Lys Glu Lys 
Phe Asp He Asp Leu Glu Arg Ser He oiu ^ 

130 1JS 



„ val Leu Oly Tyr » Al, Tyr Pro Asu Asn Pro Thr oly Asn Leu 

145 150 

P he s.r Arg «, Lys He Glu Oiu xle Arg A.u Arg Gly val Phe cys 

165 

_ H . ^ qer Qlv Glu Thr Phe Leu Glu 
Val He Asp Glu Ala Tyr Tyr His Tyr Ser Gly fa ^ 



180 185 



Ala Leu Lys Arg Glu Asp Thr Val Val Leu Arg Thr Leu Ser Lys 



Asp Ala beu uy* 

195 ^ uu 



58 



5or tpu Arq Val Gly He Leu He Gly Lys Gly Glu 
He Gly Met Ala Ser Leu Arg vai «xy ^ 

210 215 
lie Val Ser Glu IU Ash Lys V,! Arg Leu go Phe Asn val Thr Tyr 

HI ser «. val Met Ala Lys val > J- «r «. Gly "3 «« 

245 ^ su 

^.i vai Thr Glu Arg Glu Arg Met Tyr 

Leu Met Glu Lys He Gin Glu Val Val Thr Giu 3 ^ 

260 2b& 

t-i oi„ riv val Glu Val Phe Pro Ser Lys Ala 
Asp Glu Met Lys Lys He Glu Gly Val Giu va ^ 

275 280 



x *ra Thr Pro Tyr Pro Ala His Glu Val Tyr Gin 

Asn Phe Leu Leu Phe Arg Thr Pro iyr ^ 

290 295 
Glu Leu Leu Lys Arg Asp Val Leu Val Arg Asn Val Ser Tyr Met Glu 

305 310 

Gly L eu Gl» ,ys «. Leu «. val ser Val Gly Lys Pro Glu «U — 

„„ I.y, Phe Leu Glu Al, Leu Glu Glu Ser II. Lys ser Leu Ser ser 

340 345 



Ser Leu 



„l INFORMATION FOR SEQ ID NO: 32: 

* (i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 303 AMINO ACIDS 

(B) TYPE: AMINO ACID 
(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Lys Pro Tyr Ala Lys Tyr He Trp Leu Asp Gly Arg He Leu Lys 

Trp Glu Asp Ala Lys He His Val Leu Thr His Ala Leu His Tyr Gly 
V 20 ^ 



^ ^ ri« Tie Ara Gly Tyr Trp Asn Gly Asp Asn Leu 
Thr Ser He Phe Glu Gly He Arg uxy y ^ 

35 * 

r-i,, riii His He Asp Arg Met Tyr Arg Ser Ala 
Leu Val Phe Arg Leu Glu Glu His lie a y a &Q 

50 55 
L ys lie Leu Gly lie -J He Pro Tyr Thr Arg Glu Glu val Arg Gl» 

2 val Leu Glu Thr He Lys Ma Ash Ash Ph. Arg Glu Asp Val Tyr 

85 

Val Ala Phe Val Ala Ser Gin Thr Val Thr Leu Asp He 



He Arg Pro Val Aia rue ~" 

100 103 



59 



^ Asn Leu Olu Val Ser Leu Ala Val He Val Phe Pro Phe Gly Lys 



115 120 



^ L eu Ser Pro Asn Gly lie Lys Ala Thr He Val Ser Trp Arg Arg 



130 



val His Asn Thr Met Leu Pro Val Met Ala Lys lie Gly Gly He Tyr 

145 150 

Val Asn Ser Val Leu Ala Leu Val Glu Ala Arg Ser Arg Gly Phe Asp 

165 1/u 

Olu Ala Leu Leu Met Asp Val Asn Gly Tyr Val Val Glu Gly Ser Gly 

180 1Bb 
«» *. He Phe II. val «, Gly Oly Ar g Leu Ph. Thr Pro Pro v,l 

195 2UU 
„i. <Uu ser He Leu Olu Oly H. Thr ^ Asp Thr V.l II. Lye Leu 

210 215 
Ser Gly Asp Val Gly Leu Arg val Glu Glu Lys Pro lie Thr Arg Glu 
225 230 

Glu Val Tyr Thr Ala Asp Glu Val Phe Leu Val Gly Thr Ala Ala Glu 

245 ^ 

,r i riu val Asp Gly Arg Thr He Gly Thr Gly Lys 

He Thr Pro Val Val Glu Val Asp ^xy 21Q 

260 

Pr „ Oly Pro lie Thr Thr Lys II. «. Olu Leu Tyr S.r _ v.l V,l 

275 280 
61y Lys val Glu Lys Tyr Leu Asn Trp He Thr Pro Val Tyr 



Arg 2 95 

290 A ^ 



( 2 ) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGGCAGTC AAAGTGCGGC CT 

(2 ) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 



60 



(ii) MOLECULE TYPE: cDNA 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CGGAGGATCC TTATCCAAAG CTTCCAGGAA G 



31 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1,092 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: GENOMIC DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

SSSSESSKSESSSKE^S 

20 25 

S !S i| S S E 2£ SK 2£ S S S £ E S 55 

50 55 
85 

Met Leu Ser Leu Ala Leu Leu Ala Pro my 11Q 
100 lo;> 

SEESSS^SESS2S2SSSS 

115 120 

CE3E£ES5£S = ™ = *S 

145 150 

2SSS5S5S2SSSSSSE5 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 
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195 2UU 

»«fS^SSS?^SSS S S 5? S 

Leu Arg Arg Gly Tyr Asn vai 22Q 
210 ^ 

ssssssssssfssgss 

WSESSSSSS2SSSS 2 

Ala Ala Gin He Ala Ala Leu Ala Axa w 2?<J 
260 Zbs> 

ococrrxco coc cag gtc aac gaa gaa ggg aag gtt ttt ctc tac cga 
Ala Leu ser Arg Gin Val Asn Glu Glu Gly Lys ^ 
275 280 

s ssss s « s 

Glu Leu Glu Arg Arg Gly lie Ala Tyr va ^ 
290 ^ 

S2S5SS55S2SSSSSS 

s s a s; 1 s s s ssj s s s s I - 

340 J4b 

OT „ a , MS «. or* «o en j» ™ «J " 

Ala Leu Asp Lys Ala Leu Glu Leu Arg y ^ 
355 JbU 



576 



624 



672 



720 



816 



864 



912 



960 



1056 



(2) 



INFORMATION FOR SEQ ID NO: 36 : 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: AMINO ACIDS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
Met ,la Val Lys Val ^ Pro Glu Leu Ser Gin Val Glu lie ^ « 
Pro Gly Lys Pro lie Glu Glu Val Lys Lys Glu Leu Gly Leu Glu Glu 

62 



20 



25 



30 



Val Val Lys Leu Ala Ser Asn Glu Asn Pro Leu Gly Pro Ser Pro Lys 

Ma val Ala Ala Leu Glu Gly Leu Asp His Trp His Leu Tyr Pro Glu 

50 55 
Gly Ser Ser Tyr Glu Leu Arg Gin Ala Leu Gly Lys Lys Leu Glu lie 

Hp Pro Asp Ser !le He Val Gly Cys Gly Ser Ser Glu Val lie Gin 

Met ^u Ser Leu 2 Leu Leu Ala Pro Gly As P Glu Val Val He Pro 

100 1Ub 
val Pro Thr Phe Pro Arg Tyr Glu Pro Leu Ala Arg Leu Met Gly Ala 

115 " u 
AS n Pro val Lys Va! Pro Leu Lys Asp Tyr Arg He Asp Val Glu Ala 

val Til Arg Ala Leu Ser Pro Arg Thr Lys Leu Val Tyr Leu Cys Asn 

145 150 

Pro Asn Asn Pro Thr Gly Thr lie Val Thr Arg Glu Glu Val Glu Trp 

165 

_ h. „- rlll Glv val Leu Thr Val Leu Asp Glu Ala 
Phe Leu Glu Lys Ala Gly Glu Gly vai i*u igQ 

180 185 
^ cys Glu Tyr val Thr Ser Pro Ala Tyr Pro Asp Gly Leu Asp Phe 

195 200 

» *r« riv Tvr Asn Val Val Val Leu Arg Thr Phe Ser Lys He 
Leu Arg Arg Gly Tyr ash v<* 2 20 

210 215 
^ Gly Leu Ala Gly Leu Arg He Gly Tyr Gly Val Ala Asp Arg Glu 

225 230 

^ val Ala Glu Leu His Arg Val Arg Glu Pro Phe Asn Val Ser Ser 

245 z ^ 
Ala Ala Gin He Ala Ala Leu Ala Ala Leu Glu Asp Glu Glu Phe Val 

260 

- AXa Leu Ser' Arg Gin val Asn Glu Glu Gly Lys Val Phe Leu Tyr Arg 

275 280 

^ Tla Ma Tvr Val Pro Thr Glu Ala Asn Phe 
Glu Leu Glu Arg Arg Gly lie Ala Tyr vai * ^ 

290 

Ijeu Leu Phe Asp Ala Gly Arg Asp Glu Gin Glu Val Phe Arg Arg Met 

305 310 

^ Arg Gin Gly Val Xle He Arg X*x Gly Val Gly Tyr Pro Thr H,s 

,e U Arg Val Thr 2 Gly Thr Leu Glu Gin Asn Gin Arg Phe Leu G!u 

340 34b 
Ala Leu Asp Lys Ala Leu Glu Leu Arg Gly Val 



355 



(2) INFORMATION FOR SEQ ID N0:37: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

( Xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAGAAAA GGACTTGCAA GT 



63 



(2) INFORMATION FOR SEQ ID NO:38: 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQIDNO:38: 
CGGAGGATCC TTAGATCTCT TCAAGGGCTT T 31 



(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1,085 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 



ATG AGA AAA GGA CTT GCA AGT AGG GTA AGT CAC CTA AAA CCT TCC CCC 48 
Met Arg Lys Gly Leu Ala Ser Arg Val Ser His Leu Lys Pro Ser Pro 
5 10 15 

ACG CTG ACC ATA ACC GCA AAA GCA AAA GAA TTA AGG GCT AAA GGA GTG 96 
Thr Leu Thr lie Thr Ala Lys Ala Lys Glu Leu Arg Ala Lys Gly Val 
20 25 30 

GAC GTT ATA GGT TTT GGA GCG GGA GAA~CCT~GAC"TTC"GAC ACA CCC GAC _______ ^ . 144 

Asp Val lie Gly Phe Gly Ala Gly Glu Pro Asp Phe Asp Thr Pro Asp 
35 40 45 

TTC ATA AAG GAA GCC TGT ATA AGG GCT TTA AGG GAA GGA AAG ACC AAG 192 
Phe lie Lys Glu Ala Cys lie Arg Ala Leu Arg Glu Gly Lys Thr Lys 
50 55 60 

TAC GCT CCC TCC GCG GGA ATA CCA GAG CTC AGA GAA GCT ATA GCT GAA 240 
Tyr Ala Pro Ser Ala Gly lie Pro Glu Leu Arg Glu Ala lie Ala Glu 
65 70 75 80 

AAA CTA CTG AAA GAA AAC AAA GTT GAG TAC AAA CCT TCA GAG ATA GTC 268 
Lys Leu Leu Lys Glu Asn Lys Val Glu Tyr Lys Pro Ser Glu lie Val 
85 90 95 

GTT TCC GCA GGA GCG AAA ATG GTT CTC TTC CTC ATA TTC ATG GCT ATA 336 
Val Ser Ala Gly Ala Lys Met Val Leu Phe Leu He Phe Met Ala He 
100 105 110 

CTG GAC GAA GGA GAC GAG GTT TTA CTA CCT AGC CCT TAC TGG GTA ACT 384 
Leu Asp Glu Gly Asp Glu Val Leu Leu Pro Ser Pro Tyr Trp Val Thr 
115 120 125 

TAC CCC GAA CAG ATA AGG TTC TTC GGA GGG GTT CCC GTT GAG GTT CCT 432 
Tyr Pro Glu Gin He Arg Phe Phe Gly Gly Val Pro Val Glu Val Pro 
130 135 140 

CTA AAG AAA GAG AAA GGA TTT CAA TTA AGT CTG GAA GAT GTG AAA GAA 480 
Leu Lys Lys Glu Lys Gly Phe Gin Leu Ser Leu Glu Asp Val Lys Glu 
145 150 155 160 



64 



S2SSSS I S 55 S S S SS S 25 S 
2S2SSSSSS2SSSS5S 

2 25 SS S | S S S S 1? S 5 S S S K 



GCT TTG GAT AAG GCT CTA GAG CTT AGG GGG GTT TAA 
S SS Sp Lys Ala Leu Glu Leu Arg Gly Val 

355 3bU 



864 



912 



960 



1008 



1056 



1092 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: AMINO ACIDS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE- DESCRIPTION: SEQ ID NO: 36 :_ 

Met Ala Val Lys Val Ar g Pro «» Leu Ser Gin Val Glu He Ty, Ar g 



Pro Gly Lys Pro lie Glu Glu Va! Lys Lys Glu Leu Gly Leu Glu Glu 

Val val Lys 2 Ma Ser As„ Glu Asn Pro Leu Gly Pro Ser Pro Lys 

A la val 2 Ala Leu Glu Gly Leu Asp His Trp His Leu Tyr Pro Glu 

50 ^ 
Gly Ser Ser Tyr Glu Leu Arg Gin Ala Leu Gly Lys Lys Leu Glu lie 

Hp Pro Asp ser lie He Val Gly Cys Gly ser ser Glu Val lie Gin 

Met Leu Ser Leu 2 Leu Leu Ala Pro Gly Asp Glu Val Val lie Pro 



65 



Val Pro Thr Phe Pro Arg Tyr Glu Pro Leu Ala Arg Leu Met oly Ala 

115 1/!U 
Asn Pro Val Lys Val Pro Leu Lys Asp Tyr Arg lie Asp Val Glu Ala 

Val Til Arg A!a Leu Ser Pro Arg Thr Lys Leu Val Tyr Leu Cys Ash 



Pro AS„ Asn Pro Thr Gly Thr He Val Thr Arg Olu Glu Val Glu Trp 
Ph e Leu Glu Lys Ma Gly Glu Gly Val Leu Thr Val Leu Asp Glu Ala 



180 

, - v«i Thr Ser Pro Ala Tyr Pro Asp Gly Leu Asp Phe 
Tyr Cys Glu Tyr Val Thr ser jro « * 2Q5 

195 zuu 



* riv Tvr Asn Val Val Val Leu Arg Thr Phe Ser Lys He 
Leu Arg Arg Gly Tyr Asn vax 22Q 

210 215 

Tlo rlv Tvr Glv Val Ala Asp Arg Glu 
Tyr Gly Leu Ala Gly Leu Arg He Gly Tyr wy 240 

225 230 

L eu val Ala Glu Leu His Arg Val Arg Glu Pro Phe Asn Val Ser Ser 
^ Ala Gin lie Ala Ala Leu Ala Ala Leu Glu Asp Glu Glu Phe Val 
Ma Leu ser Z Gin v., Asn Glu «1« Gly Lys Val Phe Leu Tyr Arg 

275 280 
Glu Leu Glu Arg Arg Gly lie Ala Tyr Val Pro Thr Glu Ala Asn Phe 

L eu Z Phe Asp Ala Gly Arg Asp Glu Gin Glu Val Phe Arg Arg «et 

305 31 

L eu Arg Gin Gly Val He He Arg X*x Gly Val Gly Tyr Pro Thr His 

L eu Arg Val Thr Z <Uy Thr Leu Glu Gin Asn Gin Arg Phe Leu Glu 

340 J * 

Ala Leu Asp Lys Ala Leu Glu Leu Arg Gly Val 

355 JbU 

(2 ) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 52 NUCLEOTIDES 

66 



(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAGAAAA GGACTTGCAA GT 
52 

(2 ) INFORMATION FOR SEQ ID NO: 38: 

,m oeOUENCE CHARACTERISTICS 
U) S^ENGTH: 31 NUCLEOTIDES 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

CGGAGGATCC TTAGATCTCT TCAAGGGCTT T 

(2 ) INFORMATION FOR SEQ ID NO:39: 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 



35 
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S S ^ SI S 5 S S S £ SS Sg 35 E - ST 
^ S S S S E K - S 52 S S £ K S S« 

100 10S> 
cro 0,0 GAA - GAC GAG GTT tta CTA COT AGO OCT TAC TOO ox, w 
Leu Asp Glu Qly Asp Olu Val Leu Leu f 
115 " u 

ssssaasssssssssr 

145 

- -I T £ 8£ 5 S » S S g S E E £ 55 2= 8 

TTT TGC GTG GAG ,00 000 ATT TTC AT, ATT TOO GAT GAG ™C TAT «4 

Phe Cys Val Glu Arg Oly He Phe He ia 2£)5 
195 

TAC TTC GTT TAC f « »» « « « £ S S S ~ 

Tyr Phe Val Tyr Gly Asp Ala Lys rae ^ 

210 

GAT GAA OTA AAG AAC ATA ACC TTC ACO OTA AAC OOC TTT TOG AAG AG*0 
Asp Glu Val Lys Asn lie Thr Pne in ^ 240 

225 

ssssgssBSiasssar 

„c «» •» ~ » «■ - ™ ~ "~ ~ *" ™ ™ 



68 



a on <?er Gin Ser Val Ser Asn Val 
TT i ti^ iia Ser Leu Asn ber uj-" ■=> 
Tyr Ala Lys Val He Ala ber ^ 270 

ACT ACC TTT « CAO « - « « gS S S £ ^ ^ " ^ 

Thr Thr Phe Ala Gin ryr ^ 285 

305 ^ ™r TTC TCC GCT TAC 0.©D8 

™ «* T S* S £ Tyr S S - S L, AXa Tyr A!a 

Lys Pro Glu Gly Ala Phe iyr 33Q 

m~r. far TTC CTT CTG GAA AS06 

asssasssssssssssr 
assssssssassasss- 

370 

■- 1185 
» ^rv- ptt GAA GAG ATC TAA 
AGG AGA ATA AAG AAA GCC CTT GAA ^ ^ 
Arg A rg He Lys Lys Axa u ^ 



390 

385 



(2) INFORMATION FOR SEQ ID N0:40: 

IR1 TYPE: AMINO ACID 
(D) TOPOLOGY : LINEAR 

MOLECULE TYPE: polypeptide 
(xl) SEQUENCE DESCRIPTION: SEQIDNO:40: 

Met Lys OXy Leu A!a Se, « « - - ~ * ™ ^ ~ 



69 



# 



* 



2 0 

„ VaX XXe «Y - -V - 0X y - ~ A sp « *» »P 

phe Ile ^ G Xu Ma 0,3 xxe « - x~ « - cx y Ly s Tta 
v Ma « Ser Ma 0X y Ue Pro «0» - „ CX» Ma XI. ^ «g 
J s Leu « LyS «- - LY e VaX OXU ^ „ P™ Ser OXu XXe VaX 
val Ser Ma a ly I LyS VaX J- - - XX. - - - - 
x- „ cxu Z « - - g; ,eu P~ Ser « «P V- - 

115 

v Pro CX» OXn XXe « 5. - -V «T VaX - VaX CX» VaX Pro 

« ~ «» « - - - « - - - ~ & - 

Pro Thr 0X y Ma - ^ OXU G X» G X» - X- ^ XJ. Ma CXu 
Fhe VaX I „ ^ XX. J- - - - « SS - ^ 

195 

v Eh e VaX ^ OX. * - « - 5 ^ ^ 

ASP l VaX « XXe * - - VaX *sn Ma P h e Ser „ Ser 
:l S er Me, * 0X Y T . « - gg vaX Ma Pro OXu G X» 
v Ma X,* vaX 'XXe Ma Ser ~ « Ser - « VaX Se; » VaX 
Thr * P,e Ma 0X„ ^ 0X y Ma X.eu OXu Ma L eu g. - Pro , yS 

ser Lys « P h e vaX f u Met M9 M n Ma £ 0X U „ « « 

290 ^ & 



70 



+ 



* 



Asp Thr Ala val GlA Glu Leu Ser Lys He Pro Gly Met Asp Val Val 
305 \ 310 

Ly s Pro Glu Gly Ala W Tyr lie Phe Pro Asp Phe Ser Ala Tyr Ala 
Glu Lys Leu Gly Gly \sp Val Lys Leu Ser Glu Phe Leu Leu Glu Lys 



340 



345 



Ala Lys 



val Ala Val v\l Pro Gly Ser Ala Phe Gly Ala Pro Gly Phe 



355 



360 



T q _ „ ^ Ala \ Leu Ser Glu Glu Arg Leu Val Glu Gly He 
Leu Arg Leu Ser Tyr AJ.a\ J^eu 3gQ 



370 



375 



Arg Arg He Lys Lys Ala ^eu Glu Glu lie 
385 390 



71 



